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Abstract 

We establish the eventual periodicity of the spectrum of 
any monadic second-order formula where 

(i) all relation symbols, except equality, are unary, and 

(ii) there is only one function symbol and that symbol is 
unary. 



1 Introduction 

Durand, Fagin and Loescher established the eventual pe- 
riodicity of the spectrum of any monadic first-order for- 
mula with one unary function symbol [3 1. (They also dis- 
cuss the cases of two unary function symbols and one bi- 
nary function symbol; in either of those cases the spectra 
are more complicated.) In this paper, we are interested in 
monadic second-order logic (MSO). 

Let us recall the relevant definitions. The spectrum of a 
first-order or second-order formula is the set of the cardi- 
nalities of its finite models. A set S of natural numbers 
is eventually periodic if there exist natural numbers 9 and 
p > such that for every n > 9, if S contains n then it 
contains n + p. 

Monadic first-order formulas with one unary function 
symbol are first-order formulas subject to the following 
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two restrictions: 

• All predicate symbols, with the exception of equal- 
ity, are unary. 

• There is only one function symbol. It is unary. 

MSO formulas with one unary function symbol are defined 
similarly. 

Upon learning the result of Durand, Fagin and Loescher, 
we noticed that the monadic second-order composition 
method gives the following generalization. 

Theorem 1.1 (Main Theorem) The spectrum of any 
monadic second-order formula with one unary function 
symbol is eventually periodic. 

Durand, Fagin and Loescher kindly mentioned our gener- 
alization at the end of their paper. For no reason, a proof 
sketch of the generalization was lying idle all these years. 
In this paper we prove the Main Theorem. Additional re- 
sults will appear in 1 17 1. We give also a direct proof of the 
decidability of the finite satisfiability of MSO formulas 
with one unary function symbol. The fact of decidability 
is known |T2|[T1. 

The paper is organized as follows. Section |2]presents the 
basics of the composition method. Sections [5H6]prepare 
the ground for the proof of the Main Theorem; the proof 
itself is given in Sectional Specifically, Sections [5] gives 



simple facts on eventual periodicity, Section0]introduces 
structures of relevance to the proof of the Main Theorem, 
Section|5]reduces the Main Theorem to a similar theorem 
where the role of formulas is played by special finite frag- 
ments of the theories of function graphs, and Section [6] 
introduces relevant operations on structures of relevance. 
The direct decidability result is proven in Section[8] 



2 The Composition Method 

Under certain circumstances, a composition of structures 
gives rise to a composition of their appropriately defined 
types. That observation lies in the heart of the composi- 
tion method. We explain a simple version of the method 
that goes a long way and that is sufficient for our pur- 
poses in this paper. At the end, we mention some further 
developments. Lower your expectation of the feasibility 
of algorithms. 

Let L be a purely relational language of finite vocabulary, 
X, Y be models for L, and xx, x%, . . . (resp. y%, y%, ■ ■ .) 
be elements of X (resp. elements of Y). We write 
(fi(vi, . . . , Vj) to indicate that p is a formula with free 
variables among V\ , . . . , Vj. 

Definition 2.1 The 0-theory th°(X, xx, ■ ■ • , xj) is 
the set of atomic formulas ip(vx, . . . , Vj) such that 
X |= (p(x\, . . . ,Xj), and the (d + l)-theory 
th d+1 (X, x\, . . . , Xj) is the set of d-theories 
th d {X, xx, ■ ■ ■ , Xj, Xj + i) where Xj + x ranges over 
the elements of X. □ 

Let Atom(j) be the set of atomic formulas with variables 
among Vx, ■ ■ ■ , Vj, Box°(j) be the powerset of Atom(j), 
and Box d+1 (j) be the powerset of Box d (j + 1). Every 
Box rf (j) is hereditarily finite. 

Lemma 2.2 Every d-theory th d (X, xx, ■ ■ ■ , Xj) belongs 
toBox d (j). 

Proof Induction on d. □ 



It follows that every th d (X, x\ , . . . , Xj) is finite. A bound 
on the cardinality of th d (X computable 
from d and j. 

The d-theory th d (X, X 1 j ■ ■ . , Xj ) IS closely associated 
with the set of formulas ip(vx, ■ ■ ■ ,Vj) of depth d such 
that X \= (p(xx, ■ ■ ■ , xj). 

Lemma 2.3 For every t £ Box d (j), there is a formula 
St (vx , . . . ,Vj) of quantifier depth d such that the follow- 
ing holds for all X and all Xx, ■ ■ ■ , Xj. 

1. th d (X, xx, ■ ■ ■ , Xj) = t if and only if X \= 

E t (xx, ■ ■ ■ ,Xj). 

2. If th d (X, X\ , . . . , Xj) — t then, for every formula 
ip = ip{vi , . . . ,Vj) of quantifier depth < d, we have 

• if X ^ <f{xx, ■ ■ ■ ,Xj) then e t implies ip, and 

• if X \= -i(p(x\ , . . . , Xj) then St implies ->tp. 

Proof Both claims are proved by induction on d, and in 
both cases the cases d = is obvious. Let d = c + 1 > 0. 

1. The desired £t(vx, ■ ■ ■ , Vj) is the conjunction of for- 
mulas 3vj + xE s (vi, ■ ■ ■ , Vj+x) where set and formulas 
-i3uj + i£ s (ui, . . . ,Vj + i) where s 6 Box c (j + 1) — t. 

2. Assume that th d (X, x\, . . . , Xj) = t. Every 
formula ip = ip(vx, ■ ■ ■ ,vj) of quantifier depth < 
d is equivalent to a Boolean combination of formu- 
las 3vj+x<Pi(vi, ■ ■ ■ ,Vj+x) where the quantifier depth 
of every pi is < c. Therefore it suffices to prove 
the second claim for the case when tp has the form 
3v j+ xi^(vx, ■ ■ -,Vj + x)- 

First suppose that X \= ip(xx, ■ ■ ■ ,Xj). Then there 
exists Xj + x such that X \= ip(xx, ■ ■ ■ , Xj+i- Let 
s = th c (X, xx, ■ ■ ■ , Xj+i). Clearly, s e t. By 
the induction hypothesis, e s implies ip. It follows that 
3vj+ie s (vx, ■ ■ ■ ,Vj+i) implies ip, and therefore e t im- 
plies p. 

Second suppose that X \= -i(p(xx, ■ ■ ■ ,Xj). By contra- 
diction suppose that e t is consistent with p. Then there 
exist Y and yx, ■ . . , Vj+x such that Y \= s t (yx, •••,%') A 



*Kz/i> • • • j Vj+i)- B y the fi rst claim , th d (Y, y 1 ,...,y j ) = 
t, and so the c-theory s = th c (Y, yi, . . . , yj+i) belongs 
to t. By the induction hypothesis, £ s implies ip. Hence 
3vj + ie s implies <p. Since s E t = th d (X,xi, . . . ,Xj), 
we have X (= e a (a;i, • • • , Xj,Xj+i) for some Xj+\. 
Hence X \= (p(x%, . . . , Xj) which is impossible. □ 

Corollary 2.4 There is an algorithm that, given a d- 
theory th d (X) and a sentence ip of quantifier depth d de- 
cides whether X \= ip. 

Corollary 2.5 For every sentence ip of quantifier depth 
d, there are d-theories t\, . . . ,tk such that any X j= tp if 
and only ifth d (X) E {ii, . . . , 

There is a close connection between finite theories and 
Ehrenfeucht-Fraisse games. 

Lemma 2.6 The following are equivalent: 

1. th d (X, Xl , . . . , Xj ) = th d (Y,y u . . . , yj ), 

2. the duplicator has a winning strategy in 
EF d ((X,x 1 ,...,x j ),(Y,y 1 ,...,y j )). 

Proof Induction on d. The case d — is obvious. Sup- 
pose that d = c + 1 > 0. 

1— >2. Without loss of generality, on the first step, the 
spoiler chooses some element Xj+i in X. The c-theory 
th°(x\, . . . , Xj+i) belongs to th d (X, x\, . . . , xf) and 
therefore to th d (Y, yi, . . . , yf). So the duplicator can find 
an element yj+i E Y such that th c (X, x\,..., Xj+i) = 
th c (Y, j/i, ... , j/j+i). In the rest of the game, the duplica- 
tor uses the winning strategy guaranteed by the induction 
hypothesis. 

2— *7. By the virtue of symmetry, it suffices to 
prove that every member th c (X,xi, . . . ,xj + i) 
of th (X, Xi, . . . , Xj) equals to some member 
th c (Y, yi ,...,y j+1 )) of th d (Y, yi ,..., yj )). Con- 
sider a play where the spoiler starts by choosing an 
element Xj+i in X and the duplicator chooses some yj+i 



in Y such that he has a winning strategy in the remain- 
der game EF d ((X,xi, . . . ,x j+1 ),(Y, yi , . . . ,y j+1 )). 
By the induction hypothesis, th c (X, x\, . . . , %*+i) = 
th c (Y, yi ,...,y j+1 )). ' □ 

To illustrate this most basic version of the composition 
method, we give a simple example. Later in this paper 
we define a finite forest as a finite acyclic directed graph, 
where every vertex has at most one outgoing edge, to- 
gether with a unary relation Orphan that consists of the 
vertices without outgoing edges, the orphans. (Actually 
we will use finite forests that may be endowed with addi- 
tional unary relations, but for now let's forgo that expan- 
sion.) 

Every finite forest is obtained from a singleton forest by 
means of the disjoint union operation X+Y and the unary 
operation X' that attaches a root to the given forest X and 
thus turns it into a tree. 

Lemma 2.7 

1. The d-theory ofth d (X + Y) is uniquely determined 
by the d-theories th d (X) and th d (Y). 

2. The d-theory of th d {X') is uniquely determined by 
the d-theory th d (X). 

Proof Use Ehrenfeucht-Fraisse games. □ 

This lemma can be called the first-order composition 
lemma for finite forests. It allows us to define the oper- 
ation th d {X) + th d (Y) = th d {X + Y) and (th d {X))' = 
th d (X') on d-theories. Both operations on c?-theories are 
computable. 

Proposition 2.8 The first-order theory of finite forests is 
decidable. 

Proof By Corollary |2.4l it suffices to show that the set 
Sd of the d-theories of finite forests is computable. Every 
member of Sd is obtained from the d-theory of the single- 
ton forest by means of the two operations on d-theories. 
Since Sd is finite, this allows us to compute 5^. □ 



Now we turn to MSO logic. It is convenient to represent 
second-order structures as special first-order structures. 

Definition 2.9 The Boolean associate 93 (X) of a struc- 
ture X is the atomic Boolean algebra whose atoms are the 
element of X together with 

• the relations of X as relations of 23 (X), and 

• an additional unary relation Atomic that consists of 
the atoms. □ 

Definition 2.10 The MSO d-theory Th d {X) is the first- 
order d-theory th d (<£>(X)). □ 

Remark 2.11 The definition of first-order d- theories 
above had a prerequisite: the first-order language is purely 
relational. The point of the prerequisite is to ensure 
that every Box°(j) is finite. In the presence of func- 
tion symbols, for sufficiently large j, we have infinitely 
many terms with variables among vt,...,Vj and there- 
fore Box d (j) is infinite. The prerequisite can be waved 
for Boolean associates because, in the case of Boolean 
algebras, we have only finitely many inequivalent terms. 
But then the definition of bx d (j) should be modified so 
that only terms in an appropriate normal form are used. □ 

Our example generalizes to MSO. 

Lemma 2.12 

1. Th d (X + Y) is uniquely determined by Th d (X) and 
Th d {Y). 

2. Th d (X') is uniquely determined by Th d {X). 

This lemma can be called the MSO composition lemma 
for finite forests. It allows us to define the operation 
Th d (X) + Th d {Y) = Th d (X + Y) and (Th d {X))' = 
Th d {X') on d-theories. Both operations on MSO d- 
theories are computable. 

Proposition l2.8l and its proof generalize to the MSO case 
but we give an alternative proof that bounds the size of the 
minimal forest with a given MSO e?-theory. 



Proposition 2.13 There is a computable function F(d) 
such that, for every d and every finite forest X, there is a 
forest Y of cardinality < F{d) with Th d (X) = Th d (Y). 

Proof Given a number d, build a finite sequence S = 
(X\,X2, . . .) of finite forests as follows. Xi is the sin- 
gleton forest. Suppose that X\ , . . . , X^ have been con- 
structed and let i,j range over {1, . . . , k}. If there is 
a finite forest Y of the form X[ or Xi + Xj such that 
Th d {Y) differs from any Th d (X l ), choose any such Y 
and set Xk+i = Y; otherwise halt. 

The sequence S contains every MSO the <i-theory of every 
finite forest. The length of S is bounded by the cardinality 
of the Box d (0) for the first-order language of the Boolean 
associates of finite forests. The cardinality of any Xk+i is 
bounded by the double of the maximal cardinality of the 
forests inXx, . . . , Xf.. This gives the desired F(d). □ 

The proposition implies that the MSO theory of finite 
forests is decidable. The decision algorithm is non- 
elementary, but there is no elementary algorithm for that 
decision problem. This follows from Stockmeyer's re- 
sult that the decision problem for the first-order theory of 
words is known to be non-elementary [S74,R02]. Words 
can be seen as special trees with additional unary rela- 
tions. The order relation (on the positions) which is a 
given in Stockmeyer's theorem is MSO definable from the 
successor relation which is a given in our case. 

Remark 2.14 One subtlety of the composition method is 
that it is sensitive to the precise choice of language. We 
explain this on an the example of finite forests where, to 
simplify notation, we ignore the difference between sub- 
sets of a forest X and the elements of < B(X). 

We mentioned above that Th d (X') is determined by 
Th d {X). Let r be the root of X', let R be the single- 
ton forest containing r, let Ax, ■ ■ ■ , Aj be arbitrary sub- 
sets of X, let Bi = Ai n {r}, and let Q = A4 - B t . 
More generally, Th d (X' , Ai, . . . , Af) is determined by 
Th d (R, B 1} ..., Bj) and Th d (X, C x , . ■ ■ , C s ). But this 
claim of unique determination fails if we abandon the 
unary relation Orphan which might have seemed to play 
no role until now. Orphan is needed already in the case 
d = and j > 2. Indeed, let x be any element of X, 



A\ = B\ = {r} and A2 = C2 = {x}. In order to de- 
termine whether there is an edge from x to r, we need to 
know whether x is an orphan in X. 

This little exercise shows also another subtlety of the 
method. Notice how neatly each set Ai split into subsets 
Bi and Cj. It would be less convenient to work the first- 
order d-theories th d (X, x±, . . . , Xj) instead of second- 
order d-theories Th d (X, A\, . . . , Aj). Elements are not 
splittable. The method fits MSO better than first-order 
logic. □ 

The origins of the composition method can be traced back 
to the Feferman-Vaught article on "the first-order proper- 
ties of products of algebraic systems" (4)- Lauchli intro- 
duced d-theories in the context of weak monadic second- 
order logic; he proved that the weak MSO theory of linear 
order is decidable [ 1 1 1. (The weak MSO is the version of 
MSO where second-order quantification is restricted to fi- 
nite sets.) 

Shelah generalized the method to full MSO and used it 
in particular to prove in a uniform way all known decid- 
ability results for the MSO theories of various classes of 
linear orders 1151 . He introduced (k±, . . . , fcd)-theories 
comprising sentences whose prenex form has d blocks 
of quantifiers: k\ quantifiers of one kind (say, existential 
quantifiers), followed by &2 quantifiers of the other kind, 
followed by k% quantifiers of the first kind, and so on. A 
d-theory is a (fc 1; . . . , ^-theory where every ki = 1. In 
the theory of linear orders, the main composition lemma is 
about the addition (that is concatenation) of linear orders. 
In 1 15 1, an important role was played by two generaliza- 
tions of the classical Ramsey theorem (one for dense or- 
ders and another for uncountable well orderings) that take 
advantage of the fact that, for each d, the d-theories of 
linear orders together with the derived addition operation 
form a semigroup. 

MSO logic is arguably the right paradigm for the compo- 
sition method. Composition theorems reduce the theory 
of a composition of structures to a composition of their 
finite theories, but — even in the case of first-order com- 
position theorems — the reduction depends on the MSO 
theory of the index structure. MSO composition theorems 
reduce the MSO (fci, . . . , fc^-theory f a composition of 
structures to some MSO (£1, . . . , ^-theory of the index 



structure with disjoint unary relations t where t(i) means 
that t is the (k\, . . . , fc^-theory of the i th component. 

The authors used the method on numerous occasions. Ar- 
ticle (5) lays a technical foundation for more advanced 
applications of the method. Section 3 of the survey |6| 
describes the composition method (calling it the model- 
theoretic decidability technique) and Section 5 mentions 
various applications of the method. See also the disserta- 
tion 1 20 1 and exposition 1191 . A recent sophisticated use 
of the composition method over finite structues is found 
in||T6l. 

The power of the method is under-appreciated. Through- 
out the years we saw various problems that could be 
solved by the composition method. This claim was put to 
the test a couple of years ago when Alexander Rabinovich 
posed a conjecture to the first coauthor who insisted that 
the method will confirm or refute the conjecture. The con- 
jecture was confirmed Q, and Rabinovich went on to use 
the method [13 1. 

The present paper uses the simple form of the composition 
method explained and exemplified above. 

Remark 2.15 One of the reviewers asked how does the 
composition method compare with the automata-theoretic 
method. The application domains of the two methods in- 
tersect. For example, the decidability of SIS, established 
first by means of Biichi automata |2|, has also a simple 
model-theoretic proof [15 6|. Rabin used automata to 
prove the decidability of the MSO theory, known as S2S, 
of the infinite binary tree 1121 111. It is not clear whether 
the composition theory can be used for the purpose. One 
of the consequences of Rabin's result is the decidability 
of the MSO theory of rational order. This consequence 
was proved, more directly and naturally, by the composi- 
tion method [18|. This alternative proof generalizes to a 
class of dense linear orders (so-called short modest linear 
orders) [ 8 1 to which the automata method does not seem 
to apply. At least on one occasion, the two methods were 
used in a complimentary way |9|. The automata method 
has been used much more for establishing complexity re- 
sults. Lemma l8~Tl is one modest example of the use of the 
composition method for that purpose. □ 

The composition method is inherently model-theoretic, 



like that of Ehrenfeucht-Frai'sse; it may be a useful ad- 
dition to your toolbox. 



3 Eventually Periodic Sets 

Definition 3.1 A set S of natural numbers is eventually 
periodic if there exist natural numbers p > (a period of 
S) and 9 (a p-threshold for S) such that n G 5 implies n+ 
p G S for all n > 9. □ 

The definition of eventual periodicity in 1 3 1 is similar ex- 
cept that "implies" is replaced with "is equivalent to". The 
following lemma shows that this alteration makes no real 
difference. 

Lemma 3.2 Assume that a set S is eventually periodic 
with a period p and a p-threshold 9. There exists a natural 
number 9' such that 

bGS is equivalent to n + p G S for all n > 9' . 

The number 9' could be called a strict p-threshold for S. 

Proof The lemma is trivial if S is finite, so we assume 
that S is infinite. For each i = 0, . . . , p — 1, let 

Ai = {n G S : n > 9 and n = i mod p}. 

The desired strict p-threshold 9' = max{min(Ai) : Ai ^ 
0}. □ 

Corollary 3.3 

Every eventually periodic set S has a least period, and 
the least period divides any other period of S. 

Proof It suffices to prove that the greatest common di- 
visor p of periods p\ , pa is a period. Let 9i be a strict 
Pi-threshold and 9 = max(#i, 6*2). We show that 9 is a 
p-threshold for S. 

Since p = gcd(pi,p2), there exist integers 01,02 such 
that aipi + a 2 p2 = P- Without loss of generality, a\ > 
and 02 < 0. Suppose that n > 9 and n E S. Since 



9\ is a pi-threshold, rt + a>\pi G 5. Since 9-2 is a strict 
p 2 -threshold, n + p = (n + a\p\) — |a 2 | • p 2 G S. □ 

Recall that an arithmetic progression is a set of integers of 
the form 

{b + jp: j =0,1,...}. 

where p > 0. 

Lemma 3.4 (|3|) A set of natural numbers is eventually 
periodic if and only if it is a finite union of arithmetic 
progressions and singleton sets. 

Corollary 3.5 The union of finitely many eventually peri- 
odic sets is eventually periodic. 

Lemma 3.6 If the sets Si, ... , S m are eventually peri- 
odic then so is the set 

{fii H Vn m : every n, G Si} 

Proof Suppose that Si is eventually periodic with pe- 
riod pi and pi-threshold 9i, and let p be the least common 
multiple of pi, . . . ,p m . Then S is eventually periodic 
with period p and a p-threshold 9 = 6\ + ■ ■ ■ 6 m . Indeed 
suppose that n G S and n > 9. Then n = ni + • • • + n m 
where rii G Si. There is an index i such that rii> 9i. We 
have 

n+p = niH hn. 1 -i + (n 4 +p)+n. 1 +iH Vn m G S. □ 



4 Structures of Interest 

In the rest of the paper, all structures are finite. It is often 
convenient to view unary relations as sets. 

Definition 4.1 A partial-function graph is a directed 
graph where every vertex has at most one outgoing edge 
together with the unary relation Orphan that consists 
of the vertices without outgoing edges. The expres- 
sion "partial-function graph" may be abbreviated to "PF- 
graph". □ 



The relation Orphan is added for technical reasons; we 
saw already in Remark l2. 141 that the composition method 
is sensitive to the language. Orphan allows us to express 
the absence of orphans in a quantifier-free way: Orphan = 
0. 

Definition 4.2 A function graph is a PF-graph where ev- 
ery vertex has exactly one outgoing edge. □ 

The edge relation of a PF-graph will be denoted by E. If 
E(x, y) holds (so that there is an edge from x to y), then 
y is the parent of x, and x is a child of y. Children of the 
same parent are siblings. A vertex without a parent is an 
orphan. 

A nonempty sequence xq , . . . , x n of vertices such that 
E(xi, Xi+i) holds for every i — 0, . . . , n — 1 is a path 
of length nfrom xq to x n . A path is trivial if its length is 
zero; otherwise the path is proper. If there is a path from 
x to y then a; is a descendant of y and y is an ancestor of 
x. If there is a proper path from x to y then a; is a proper 
descendant of y and y is a proper ancestor of x. 

An n-vertex substructure of a PF-graph is a cycle if the n 
vertices form a proper path xq, . . . , x n -i, xq- A vertex is 
cyclic if it belongs to a cycle. A PF-graph without a cycle 
is acyclic. 

A colored PF-graph X is a PF-graph together with a finite 
collection of unary relations, the colors. The set of the 
names of the unary relations is the palette of X. It is 
convenient to think of colors as sets of vertices. 

Proviso 4.3 In the rest of this paper, a PF-graph is al- 
ways finite and colored. 

A PF-graph is connected if every two vertices have a com- 
mon ancestor. A connected PF-graph has at most one or- 
phan. 

Definition 4.4 A tree is a connected PF-graph with an or- 
phan; the orphan is the root of the tree. □ 

A connected function graph is a connected PF-graph with- 
out an orphan. It has a unique cycle and is formed by the 
cyclic vertices and their descendants. 



In any PF-graph, the relation "x and y have a common 
ancestor" is an equivalence relation. It partitions the PF- 
graph into connected components. 

Definition 4.5 A forest is an acyclic PF-graph. □ 

A PF-graph X is a forest if and only if every component 
of X is a tree if and only if every component of X has an 
orphan. A PF-graph X is a function graph if and only if 
no component of it has an orphan. 



5 Finite Theories 

An MSO PF-graph formula \ is an MSO formula in the 
vocabulary of PF-graphs. The spectrum of \ is me set of 
the cardinalities of PF-graphs satisfying \- 

Lemma 5.1 For every MSO formula ip with one unary 
function symbol, there exists an MSO PF-graph formula 
X such that ( i) every PF-graph satisfying x is a function 
graph and ( ii) the spectrum of tp equals the spectrum of 
X- 

Proof We start with a construction. First, replace in <p 
the unary function with its graph. For example, a sub- 
formula f(x) = y becomes E(x, y), and a subformula 
/(/(«)) = f(y) may become 

3x'3z{E{x, x') A E(x', z) A E(y, z)) 

Second, augment the resulting PF-graph formula with a 
conjunct Orphan = 0. This gives the desired x- 

Claim (i) is obvious. To prove (ii), notice that every model 
X of (f gives rise to a function graph Y satisfying x such 
that \X\ = \Y\; in fact, the underlying set of Y is that of 
X. Let ip' be the formula obtained from x by replacing 
every atomic formula E(x, y) with equation f(x) = y; 
clearly <p' is equivalent to (p. Every function graph Y sat- 
isfying x gives rise to a model X' for ip', and therefore 
for ip, such that \ Y\ = \X'\; in fact, the underlying set of 
X' is that of Y. □ 

Fix an arbitrary palette n. 



Proviso 5.2 In the rest of the paper, PF-graphs are of Proposition 5.5 In order to prove the Main Theorem, it 
palette tt. Accordingly, MSO PF- graph formulas use only suffices to prove that the spectrum of any finite theory of 
colors of palette tt. function graphs is eventually periodic. 



According to Definition ^. 91 the Boolean associate 93 (X) 
of a PF-graph X is the finite Boolean algebra with the 
vertices of X as the atoms together with the edge relation 
of X as a binary relation of 93 (X), the colors of X as 
unary relations of 93 (X), and an additional unary relation 
Atomic that consists of the atoms. Let T be the vocab- 
ulary of 93(X). Notice that the vocabulary T does not 
depend on the choice of X. 

Lemma 5.3 For every MSO PF-graph formula \, there is 
a first-order T -formula tp such that, for every PF-graph 
X, we have 

X \= x if and only if 93 (X) |= ip 

Proof Obvious. □ 

According to Definition lzTol the MSO d-theory Th d (X) 
of a PF-graph X is the first-order d-theory i/i d (93(X)) of 
the Boolean associate 93 (X) of X. In the rest of the paper, 
we deal only with MSO d-theories Th d {X) of PF-graphs 
X and never with first-order d-theories th d (X). And so 
the d-theory of a PF-graph X will mean Th d (X). 

By Lemma I2.6I two PF-graphs X and Y have the 
same <i-theory if and only if the duplicator has a win- 
ning strategy in the <i-step Ehrenfeucht-Frai'sse game 
EF d (93(X), 93(F)). Every d-theory t = Th d (X) will 
be called a finite theory of PF-graphs of depth d. If X is 
a function graph then t contains the formula Orphan = 
and so every PF-graph Y with Th d (Y) = t is a function 
graph; in this case we say that t is a finite theory of func- 
tion graphs. 

Definition 5.4 The spectrum of a finite theory t of PF- 
graphs of depth d is the set of the cardinalities of the PF- 
graphs X with Th d (X) = t. The spectrum of a first-order 
T-formula %p is the set of the cardinalities of PF-graphs X 
such that <B(X)\=ip.O 



Proof Assume that the spectrum of any finite theory of 
function graphs is eventually periodic, let tp be any MSO 
formula with one unary function symbol, and let \ be as 
in Lemma l5~TI Every PF-graph satisfying \ is a function 
graph. It suffices to prove that the spectrum of \ is even- 
tually periodic. 

Let ip be as in Lemma 1531 Every PF-graph satisfying x 
is a function graph. It suffices to prove that the spectrum 
of i> is eventually periodic. 

Let d be the quantifier depth of if>. By Corollary 12.51 
there are (/-theories t%, ... ,tj of function graphs such that 
any function graph X \= ip if and only if Th d {X) G 
{ti, . . . , tj}. By Corollarv l3.5l the spectrum of ip of even- 
tually periodic. □ 

Remark 5.6 Since the definable relation Orphan is 
present in the vocabulary of PF-graphs, we should be 
careful with the notion of a substructure of a PF-graph 
X. Consider the following requirement of a subset A of 
X: for every edge E(x, y) of X, if A contains x then it 
contains y. If the requirement is satisfied then the nor- 
mal definition of a substructure of X generated by subset 
A works correctly; otherwise the normal definition does 
not work correctly. We will use the notion of substructure 
only in cases when the normal definition works. □ 

6 Composing Graphs and their 
Theories 

Definition 6.1 The sum X + Y of PF-graphs X, Y is the 
PF-graph that is the disjoint union of X and Y. □ 

Every PF-graph is the sum of its connected components. 

If X = Xi H + X m and Y = Y x + ■ ■ ■ + Y n then 

X + Y = Xi + ■ ■ ■ + X m + Yi + • • • + Y n . If X, Y are 
function graphs then so is X + Y. 



If X and Y are disjoint they will be presumed to be sub- 
structures of X + Y. If X are Y are not disjoint, one or 
both of them should be replaced with isomorphic copies 
in some canonic way. (Of course, we are interested in 
structures only up to their isomorphism type, so it does 
not make any difference how the isomorphic copies are 
chosen. It is convenient though to work with structures 
rather than with their isomorphic types. In particular, it is 
convenient that the components of a PF-graph form a set 
rather than a multiset.) 

Lemma 6.2 Th d (X + Y) is uniquely determined by 
Th d {X) andTh d (Y). 

Proof Use the Ehrenfeucht-Frai'sse games. As far as the 
Duplicator is concerned, the game EF d (X + Y, X' + Y') 
splits into EF d (X, X') and EF d (Y 1 Y'). □ 

The lemma justifies the following operation on finite the- 
ories of the same depth. 

Definition 6.3 Th d (X) + Th d (Y) = Th d (X + Y). □ 

Obviously the operation is commutative and associative. 
For technical reasons, we introduce the notion of dotted 
PF-graphs. 

Definition 6.4 A dotted PF-graph (X, a) is a PF-graph 
X with a distinguished element a, the dot of (X, a). □ 

The Boolean associate 55 (X, a) of (X, a) is the Boolean 
associate 25 (X) of X expanded with a distinguished el- 
ement a. The MSO d-theory Th d (X, a) of a dotted PF- 
graph (X, a) is the first-order <i-theory th d (' ; B(X, a)) of 
the Boolean associate of (X, a). 

Definition 6.5 Let (X, a) and (Y, b) be dotted PF-graphs. 

1 . The dotted sum X+(Y, b) is the non-dotted PF-graph 
Z obtained from X + Y by making b the parent of 
every orphan in X. 

2. The dotted sum (X, a) + (Y, b) is the dotted PF-graph 

(X + (Y,b),a). 



3. The circular sum (X, a) © (Y, b) is the non-dotted 
PF-graph Z obtained from X + Y by making b the 
parent of every orphan in X and making a the parent 
of every orphan in Y. □ 

Lemma 6.6 

1. Th d (X + (Y, b)) is uniquely determined by Th d (X) 
andTh d (Y,b). 

2. Th d ((X, a) + (Y, b)) is uniquely determined by 
Th d (X,a)andTh d {Y,b). 

3. Th d ((X, a) © (Y, b)) is uniquely determined by 
Th d {X, a) andTh d (Y,b). 

Proof Use the Ehrenfeucht-Frai'sse games. □ 

The lemma justifies the following operations on finite the- 
ories of the same depth. 

Definition 6.7 Let (X, a) and (Y, b) be dotted PF-graphs. 

1. Th d (X) + Th d (Y, b) = Th d (X + (Y, b)) 

2. Th d (X, a) + Th d (Y, b) = Th d ((X, a) + (Y, b)) 

3. Th d (X, a)®Th d (Y, b) = Th d {(X, a) © (Y, b)) □ 

The second operation is associative; we will use that fact. 



7 Proof of the Main Theorem 

Let d be an arbitrary natural number. For each e?-theory a 
of function graphs, let p a be the cardinality of the small- 
est function graph with <i-theory a. For each d-theory r 
of dotted PF-graphs, let q T be the cardinality of the small- 
est dotted PF-graph with <i-theory r. Let p be the least 
common multiple of the numbers p a and q T for a and r 
as above. We prove that the spectrum of any d- theory s of 
function graphs is eventually periodic with period p. By 
Proposition l5.5l this implies the Main Theorem. 



So let s be an arbitrary d-theory of function graphs and 
let X be a sufficiently large function graph with d-theory 
s. We construct a function graph Y of cardinality \Y\ = 
\X\ + p such that Th d (Y) = s. The meaning of suf- 
ficiently large will be clarified in the course of the con- 
struction. We consider several cases. 

Case 1: The number of components of X exceeds the 
number of d- theories of function graphs. 

Let X\ , . . . , X m be the components of X in some or- 
der, and let Yi be the sum X\ + ■ ■ ■ + Xi of the first 
i components. We use the fact that the sum operation 
on PF-graphs is associative. By the definition of Case 
1, there exist i < j such that Th d {Yi) = Th d (Yj). 
Let t = Th d {Yi), u = Th d (X i+1 + ■■■ + Xj), and 
v = Th d (X j+ i + • • • + X m ). Then t + u = t and 
t + v = s. Furthermore, t + k ■ u = t for every posi- 
tive k. By the definition of p u , there is a function graph 
Z such that Th d {Z) = u and \Z\ = p u . To obtain the 
desired Y, augment X with k = p/p u components iso- 
morphic to Z. Clearly, \Y\ = \X\ + k-p u = \X\+p. The 
order of the components does not matter of course but it 
is convenient to imagine that the new components come 
after Xj and before Xj + \. Then 

Th d (Y) = t + k-u + v = t + v = s 

Case 2: X has a vertex a such that the number of non- 
cyclic children of a exceeds the number of rf-theories of 
forests. 

Each noncyclic child y of a, together with its descendants, 
forms a tree; let us call it T y . Let F be the forest formed 
by the trees T y , so that the number of the components of 
F exceeds the number of the d-theories for forests. By the 
argument of Case 1 , applied to forests rather than function 
graphs, there exists a forest G of cardinality \G\ = \F\ +p 
such that Th d (G) = Th d (F). 

Let Q be the function graph obtained from X by removing 
the trees T y so that X = F+(Q, a). The desired function 
graph y = G+(Q,a). 

Case 3: X has a cycle of cardinality m that exceeds the 
number of rf-theories of dotted trees. 

Arrange the vertices of the cycle into a path 



do, . . . , a m -i, ao. Let Tj be the tree formed by the 
vertex a, and its non-cyclic descendants. We use 
the fact that the dotted sum operation on dotted PF- 
graphs is associative. For each i G {l,...,m}, let 

(Xi, a ) = (T ,a ) -i i- (Tj_ 1; aj_i), so that (X h a ) 

is a dotted tree with root aj_i. 

By the definition of Case 3, there exist i < j such that 

Th d (Xi,a ) - Th d (Xj,ao). Let (U,*) = {T t ,a t ) + 
••• + (Tj_i,aj_i) and let t = Th d (X l ,a ), u = 
Th d (U,cii) so that t = t + u. Let V be the PF-graph 
obtained from X by removing the tree Xj. (V, aj) is a 
dotted tree with root a TO _i, and X = (Xj,a ) © (V, aj). 
Let v = Th d (V 7 aj) so that s = t®v. 

There exists a dotted tree (Z\ , c\ ) such that 
Th d {Z llCl ) = u and \Z±\ = q u . Let (Z,c) be the 
dotted sum of p/q u copies of (Z\,ci) so that \Z\ = p. 
The desired Y = ((Xj,a ) + {Z, c)) (V, aj). Clearly, 
\Y\ = \X\ + p. Since t + u = t and t © v = s, we have 

Th d (Y) = (t+ (p/q u ) ■u)®V = t®V = S 

Case 4: X has a path ao, . . . , a m _i composed of non- 
cyclic elements such that m exceeds the number of d- 
theories of trees. 

This case is similar to Case 3. Trees Tj and dotted trees 
(Xj,oo) are defined as above but this time we are in- 
terested in undotted trees Xi. By the definition of Case 
4, there exist i < j such that Th d (Xi) = Th d (Xj). 
Let (U, di) = (T u at) + ■ ■ ■ + (Tj-i, aj_i) and let t = 
Th d {Xi), u = Th d (U, di) so that t = t + u. Let V be 
the function graph obtained from X by removing the tree 
Xj. Clearly X = Xj + {V,dj). Let v = Th d (V, aj ) so 
that s = t + v. 

As in Case 3, there is a dotted tree (Z, c) such that \Z\ = p 
and t + Th d (Z, c) = t. The desired Y = Xj + (Z, c) + 
(V, aj ) . Clearly, \Y | = \X \ + p. Further, 

Th d (Y) = {t + Th d (Z, c))+v = t + v = s 

This completes Case 4. 

Finally, let K be the class of function graphs X that do 
not fall into any of the four cases. Since X does not fall 
into Case 1, it has only so many components. Since X 



does not fall into Case 3, the cycle of any component of 
X is only so long. Since X does not fall into cases 2 and 
4, every cyclic element of X has only so many descen- 
dants. It follows that there is a bound 8 on the cardinality 
of any member of K. Thus, the spectrum of s is even- 
tually periodic with period p and p- threshold 6+1. The 
main theorem is proved. 

Remark 7.1 In the original proof sketch, we used the fi- 
nite Ramsey theorem in Cases 3 and 4. Writing up the 
proof we realized that these cases are similar to Case 1 
where only the associativity of the sum operation was 
used. □ 



8 Finite Satisfiability 

Lemma 8.1 There exists a computable function F(d) 
such that, for every d and every function graph X, there 
is a function graph Y of cardinality < F(d) such that 
Th d (X) = Th d (Y). 

Proof The proof is similar to the proof of Proposi- 
tion F° r the reader's convenience, we make it in- 
dependent from Section Q All we have to do is to show 
that every function graph can be constructed from single- 
ton function graphs by means of the tree operations of 
Section|6] Unfortunately this is not quite true. We need to 
revise two aspects of this plan. 

First, it will be easier to deal with a larger class K of 
finite structures: function graphs, forests, dotted forests 
and singleton dotted function graphs. 

Second, we need to extend Definitions 16 . 1 l and 16 . 51 
Definition 8.2 Let {X, a) and (Y, b) be dotted PF-graphs. 

1. X + (Y,b) = ((X + Y),b). 

2. (Y,b)+X = ((X + (Y,b)),b). 

For any d, the d-theories Th d (X + (Y,b)) and 
Th d ((Y, b)+X) are uniquely determined by Th d {X) and 
Th d (Y, b). Now we can carry out our plan. 



Call a K structure good if it is obtained from singleton 
K structures by means of the tree operations. It suffices 
to prove that all K structures are good. By contradiction 
assume that there is a bad K structure and let X be a bad 
K structure of minimal cardinality. Clearly X cannot be 
singleton. To get the desired contradiction, we show that 
X is a composition of K structures of smaller cardinal- 
ity. Clearly X is connected; otherwise it is the sum of its 
components. We have three cases. 

1. A" is a function graph X. The cyclic elements of X 
form a path do, • • • , a m -i, a$. If m = 1, so that there is 
a unique cyclic element, then X = Y + (Z, clq) where Y 
is the forest of the non-cyclic elements of X and Z is the 
singleton function graph containing ao. So m > 1. Let Y 
be the tree formed by ao and its non-cyclic descendants, 
and let Z be the remaining part of X. Then X — (Y, ao)® 
{Z,ai). 

2. X isa tree X. Let b be the root of X, Y be the forest of 
the non-root elements of X, and Z be the singleton forest 
formed by b. Then X = Y + (Z, b). 

3. A" is a dotted tree (X, a). Let b be the root of X, Y 
be the forest of the non-root elements of X and Z be the 
singleton forest formed by b. If a = b then (X, a) = 
(Z, a) + Y. Otherwise {X, a) = (Y, a) + (Z, b) where b 
is the root of X. □ 



Theorem 8.3 The finite satisfiability problem for MSO 
formulas with one unary function symbol is decidable. 



Proof Use Lemma lSTl and the lemma above. □ 
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